Method and apparatus for internet protocol voice based telephony communication

ABSTRACT

A method of providing a sound file to a voice gateway for coupling the voice gateway to a receiver via a voice based telephony connection. The method includes the steps of establishing a connection to the voice gateway, dividing the sound file into smaller files with a reduced size, altering the sound file header to reflect the smaller files size. The altered file header is attached at the beginning of the smaller files. The smaller files are converted into a format usable by the voice gateway. The converted smaller files are then coupled through to the voice gateway, for playback by a user.

BACKGROUND

[0001] The invention relates, generally, to Internet based media, and more particularly to novel methods and apparatus for providing sound files to voice gateways for coupling to voice based telephony channels.

[0002] Audio signals are available through the Internet typically in the form of streaming broadcast of radio, recorded music, etc. These audio signals are typically downloaded from an Internet signal source via the user's Internet service provider through a modem to a computer. Internet media includes traditional broadcast services, such as radio stations in a multitude of content types (news, sports, music, etc . . . ). An increasing number of such stations have been establishing their own websites. These websites can be located using conventional web browsers to request specific audio files from either live or recorded audio programs. Internet radio systems make it possible to download an increasingly diverse selection of audio programs, including specific specialized programs not available from conventional broadcast media.

[0003] However the use of a visual web browser to seek, locate, and then play desired programs is currently unsuitable for use by a land line or cellular phone receiver. Thus, the majority of all media on the Internet is not available over the phone channels (land line or cellular). Such phone communication is complicated by the fact that each media type typically uses a different format. For example, two such media are MP3 and RealAudio files, each having a different specific format. Voice Gateways are commonly used to bridge the gap between the voice telephony channel and the Internet and they typically only utilize one type of audio file format that is not the format of the Internet media. Typically, in the U.S. the common format is μlaw (also commonly written as Mulaw). The problem is that the formats used by the Voice Gateways are incompatible with the format typically used by the Internet media. Thus conversion is required to couple many of the Internet media over a voice telephony connection.

[0004] A need, therefore, exists for a solution that provides a user access to the full variety of Internet media including such sources as MP3 and streaming RealAudio files to the on-the-move user via a voice based telephony connection.

SUMMARY OF INVENTION

[0005] In one embodiment a method of providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection is described in which a connection to the voice gateway is established and the original sound file is divided into smaller files. The original sound file header is also altered to reflect the smaller file size and then is attached to the smaller files. The smaller files are converted into a format usable by the voice gateway, and the converted smaller files are provided to the voice gateway for playback to the user.

BRIEF DESCRIPTION OF DRAWINGS

[0006] The invention, together with the advantages thereof, may be understood by reference to the following description in conjunction with the accompanying figures, which illustrate some embodiments of the invention.

[0007]FIG. 1 is a functional block diagram illustrating one embodiment of a communication network connecting the Internet to a telephony receiver.

[0008]FIG. 2 is a functional block diagram illustrating a specific embodiment of a method of providing a sound file to a voice gateway for coupling to a voice based telephony connection.

[0009]FIG. 3 is a flow chart illustrating one embodiment of a method of connecting to an audio source, converting an audio file, and providing the resulting usable file to a voice gateway.

[0010]FIG. 4 is a graph illustrating a specific example of an algorithm for a predefined reduced file size.

[0011]FIG. 5 is a graph illustrating a specific example of an algorithm for a linear growth of the reduced file size.

[0012]FIG. 6 is a graph illustrating a specific example of an algorithm for a logarithmic growth of the reduced file size.

[0013]FIG. 7 is a graph illustrating a specific example of an algorithm for a linear reduction of the reduced file size.

[0014]FIG. 8 is a graph illustrating a specific example of an algorithm for a logarithmic reduction of the reduced file size.

DETAILED DESCRIPTION

[0015] While the present invention is susceptible of embodiment in various forms, there is shown in the drawings and will hereinafter be described some exemplary and non-limiting embodiments, with the understanding that the present disclosure is to be considered an exemplification of the invention and is not intended to limit the invention to the specific embodiments illustrated.

[0016] In this application, the use of the disjunctive is intended to include the conjunctive. The use of definite or indefinite articles is not intended to indicate cardinality. In particular, a reference to “the” object or “a” object is intended to denote also one of a possible plurality of such objects.

[0017]FIG. 1 is a functional block diagram illustrating one embodiment of a communication network 10 connecting the Internet 12 or any other suitable network for example an Extranet or Intranet, etc . . . to a receiver, landline, or wireless device 18. The communication network 10 allows an Internet user to make a voice channel connection to the Internet 12 to receive audio communication such as MP3 audio, streaming Real Audio, broadcast radio etc . . . over a voice based telephony communication channel such as PSTN (Public Switch Telephone Network) phone connection, a VoIP (Voice over Internet Protocol) connection, etc. The communication network 10 comprises the Internet network 12 coupled to a voice gateway 14, which is coupled to a PSTN 16, as shown. The PSTN 16 couples the audio communication to a suitable voice receiver 18 such as a landline telephone or wireless phone receiver 18. An application server 22, such as the Oracle 9iAS is coupled, as shown, to the Internet 12 and/or to the voice gateway 14.

[0018] The Internet 12 allows a user, using a personal computer or any other Internet capable device, the ability to access many types of information from internet-connected sources. The desired information is maintained for example in web pages delivered upon request from servers. Some web pages are maintained by broadcast services that offer audio programs, both live and archived. These audio programs may be accessed through hyper-links, used, for example, to open a specific part of the web page or another page on the same server. In accessing a hyper link, a connection to a server may be established to select and download a specific live or archived broadcast. The downloaded broadcast could be played on an appropriate electronic device.

[0019] Referring to FIG. 1, the voice gateway 14 provides an audio communication bridge between the PSTN 16, and the Internet service provider 12 using a standard format such as a μlaw format. A voice gateway can support a choice of Automated Speech Recognition (ASR), Text-To-Speech (TTS) technologies, browser functionalities (e.g. for connecting to the Internet using standard TCP/IP (Transmission Control Protocol Internet Protocol) protocols to interpret voice data in formats such as Voice XML, SALT, etc . . . and telephony technologies (e.g. to play audio files over the phone). Typically voice gateways can be scaled to accommodate voice applications of any size, and are capable of supporting millions of users.

[0020] The PSTN 16 may include a wireless telecommunication network that provides public switched telephone network connectivity, control functions, and switching functions for wireless users, as well as connectivity to voice gateways.

[0021] The application server 22, such as an Oracle 9iAS, runs a Web site, or Internet application. The application server 22 also allows a Website or an application to be accessible from any browser or mobile device.

[0022] This communication network 10 may incorporate a land or wireless communication system to provide a requested program to a landline or wireless phone user. The requested audio program, accessed on the Internet 12, is sent to the voice gateway if it is already provided in a format usable by the voice gateway 14 (e.g. μlaw). Otherwise the requested audio may first be converted to render it suitable for coupling to the voice gateway 14. The voice gateway 14 would then transmit the converted file's audio content to the user, e.g. via a PSTN 16. This conversion involves dividing the original file to a suitable sized file. The reduced size files are then converted to a format compatible with the voice gateway 14, and coupled through the voice gateway 14 to the PSTN 16 and then to the reciever 18.

[0023] Another embodiment of a communication network 10 connecting the Internet 12 to a receiver, landline, or wireless device 18, may be used to deliver audio files over the phone using the following 3-step process:

divide->convert->deliver

[0024] The “divide” segment of the process involves establishing a link with the server on the Internet that holds the MP3 or RealAudio files. Once the link is established the downloading of the file begins. After an initial portion of an audio file is downloaded, it is stored into a separate file, with a modified header attached which is similar to the original header but represents the new file size.

[0025] The “convert” segment of the process begins with the downloaded reduced size file while simultaneously the rest of the original file is being further divided and downloaded into separate storage files. The conversion segment takes the file that has been created by the divide segment and runs it through a series of converters to get it into the appropriate format for the voice gateways. Once the file is converted it is stored into a final file for delivery.

[0026] The “deliver” segment of the process involves informing the voice gateway of all the files that are (and will) be available once the entire “divide->convert->deliver” process is completed.

[0027]FIG. 2 is a functional block diagram 100 of a specific embodiment of a method for providing a sound file to a voice gateway 106 for coupling through the voice gateway 106 to a receiver (not shown) via a voice based telephony connection. A connection is made to an Internet file having a format that is not in a format required by the voice gateway 106 or having a size too large to play immediately and hence would benefit from division. This file can be located on the Internet 102 or on a local or source machine 110, such as a personal computer. Once a successful connection is made, the gateway 106 is informed which files will be produced at the end of the conversion process. This information also may inform the gateway 106 to wait for a predefined period of time before playing the first file, while the original source file is subdivided into files and converted into a format suitable for the gateway 106. This pause is desirable since the system requires a certain amount of time to produce the first converted file.

[0028] Most original files have a source header 112 typically at their beginning. This source header 112 defines the file parameters such as the contents of the file, its size, format, sampling rate, number of channels, bits per sample, etc. Since the original file is to be divided into smaller sized files, the source header is saved so that the smaller files have the same description as their original file source. But the source header 112 in the illustrated embodiment may also be processed by a converter in process 114 to reflect the division of the original file into smaller ones. The primary change to the source header 112 is in the portion of the header defining the size of the file. Thus, a new, converted, header 116 is created defining a smaller file size and the converted header is saved. A new file 118 is then created and the new header 116 is affixed to it (e.g. at the beginning). In the illustrated embodiment of FIG. 2, data from the source is downloaded to fill the new file 118 with data. The desired size of the new reduced size may be predetermined or be determined based on any algorithm suitable to the type of data format used by the original file. Once the predefined new file size is reached, two new processes begin. They are the conversion and the download processes.

[0029] First, the new reduced file is sent to a conversion routine 122 to be converted to the format usable by the gateway 106. And, since the downloading of data can continue during conversion a subsequent reduced size file 126 can be created and filled while conversion proceeds. As the conversion process takes place on the first file 118, the download process is taking place for the next file 126. Once the conversion process 122 is completed, the converted file 124, which is now in a usable format, is made available to the gateway 106.

[0030] When the next file 126 is complete and ready for conversion 122, a new file 134 is prepped for filling, and the process repeats converting files 126 while filling the next files 134. This parallel processing is used to ensure that no time is wasted receiving the file and converting the file. The process continues, filling, converting and transmitting subsequent reduced files until the end of the data from the original source file is reached. Once the last file with data from the original source is done downloading, the original files are removed so that system space is conserved.

[0031] The above method may be performed by the application server 22 (such as a Oracle 9iAS) or some other processor using instructions which may reside on a computer-readable medium. The computer-readable medium may be any suitable computer readable storage medium such as, but not limited to random access memory, read-only memory, flash memory, CDROM, DVD, solid-state memory, magnetic memory, and optical memory.

[0032] In an embodiment in which the connection is streaming, e.g. live-radio broadcast, the system may perform a different method of file management. This method involves overwriting old files (which have already been played by the gateway) with new information. After a predefined number of files are created, the next new file to be created is actually the first file that was created when the process began. For example, in the embodiment of FIG. 2, file 118 could be filled, followed by file 126, and file 134. Then, while file 126 is coupled to the voice gateway after conversion and while file 136 is being converted, new data may be downloaded into file 118. The process then continues cycling through the available files by converting each in sequence. This method conserves system space and improves efficiency. The voice gateway 106 is also informed of this overwriting process and is instructed to get new copies of the converted files instead of using existing copies of the old ones. While this method is highly suitable for a streaming connection, it can be used with the static connections as well.

[0033]FIG. 3 is a flow chart 200 detailing the steps of converting an audio file and providing the resulting converted file to a voice gateway. A connection is initiated, from a user, to an Internet audio source as illustrated at 202. The processing branches at 204 to an error script 206 if not successful. Once the connection is established as shown at step 208, then a play list 212 is supplied to a voice gateway informing the voice gateway what files will be coupled to it and then the audio source header is then retrieved as shown at 214.

[0034] Once the source header has been retrieved at 214, the conversion of the header begins 216. This conversion is primarily used to modify the portion of the header related to the file size to be used for the reduced size files when the original source file is subdivided into new smaller files. The new header reflects the new predetermined size of the new reduced size files. The new header is stored as indicated at step 218 so that it can be to be affixed to the beginning of each of the new reduced size files.

[0035] Once the new reduced size file is created at step 222 and the new header 220 is affixed to it, the process of filling the new file with data commences as illustrated by step 224. When the file is full, the conversion of the new file to a format usable by the voice gateway is performed as illustrated by step 226. The converted file is coupled to the voice gateway and the process continues at 228. The processes of building a subsequent file and converting the preceding file may therefore be performed in parallel, 222, 226. This parallel process continues until the end of the original audio source file is reached.

[0036] The size of the smaller size files can be either static or dynamic as determined by appropriate algorithms. The use of a predefined file size, i.e. static algorithm, is particularly useful for when the playback time of the file takes about as long as the download and conversion processes of subsequent files. FIG. 4 illustrates a graph of an example of predefined file size. Dynamic algorithms are desirable in many situations. For example, in a case where the file playback time takes longer than the download and conversion processes, then the method of providing sound files to the voice gateway may spend more time downloading and converting the files as time goes on. In such situations it can be important to start playing the initial part of the file for the user so that subsequent files can be downloaded. Many size-determining algorithms are suitable. For example, FIG. 5 illustrates an example of linearly increasing file size, while Fig.6 illustrates an example of logarithmically increasing file size. In another example where the file playback time is shorter than the time necessary for both the download and the conversion processes, it may be desirable to reduce the file size as time goes on. A minimum file size will eventually be reached using this method to avoid eventually reaching a file size of zero.

[0037]FIG. 7 illustrates an example of one possible algorithm for linearly decreasing file size and FIG. 8 illustrates an example of logarithmically decreasing file size.

[0038] In another embodiment, where the audio source is a very large audio file or a streaming feed, the process overwrites the oldest files with new data to create the new subsequent files.

[0039] Specific embodiments of a method and apparatus for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, according to the present invention, have been described for the purpose of illustrating the manner in which the invention is used. It should be understood that the implementation of other variations and modifications of the invention and its various aspects will be apparent to one skilled in the art, and that the invention is not limited by the specific embodiments described. Therefore, it is contemplated to cover the present invention any and all modifications, variations, or equivalents that fall within the true spirit and scope of the basic underlying principles disclosed and claimed herein. 

1. A method of providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, comprising the steps of: dividing the sound file into smaller files; altering the sound file header to reflect the smaller files size, and attaching it to the smaller files; converting the smaller files into a format usable by the voice gateway, and providing the converted smaller files to the voice gateway.
 2. The method of providing a sound file to a voice gateway as in claim 1 wherein the sound file is located on an Internet network or on a local Internet-connected electronic device.
 3. The method of providing a sound file to a voice gateway as in claim 1 wherein the voice gateway is provided with information regarding the files to be coupled via the connection.
 4. The method of providing a sound file to a voice gateway as in claim 1 wherein the voice gateway is requested to wait for a predetermined period of time before playing a first one of the smaller size files to be coupled via the connection.
 5. The method of providing a sound file to a voice gateway as in claim 1 wherein at least one smaller size file is filled with data while at least one other smaller size file is simultaneously converted into the usable format.
 6. The method of providing a sound file to a voice gateway as in claim 1 wherein the format of sound files usable by the voice gateway is μlaw.
 7. The method of providing a sound file to a voice gateway as in claim 1 wherein the sound file is divided into smaller size files suitable for the voice gateway.
 8. The method of providing a sound file to a voice gateway as in claim 7 wherein a predefined file size is used whenever the playback speed of the reduced size files takes about as long as the download and the conversion processes.
 9. The method of providing a sound file to a voice gateway as in claim 7 wherein the division of the original sound file into smaller size files performed in accordance with an appropriate algorithm chosen to timely balance the playback of a reduced size file and the download and the conversion of the subsequent files.
 10. The method of providing a sound file to a voice gateway as in claim 9 wherein the appropriate algorithm used to divide of the original sound file into smaller size files results in one of a linear growth, a linear reduction, a logarithmic growth, and a logarithmic reduction of the size file of all subsequent files.
 11. A method of providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, comprising the steps of: dividing the sound file to form smaller sized files converting a first one of the smaller size file to a format suitable for the voice gateway, and downloading data simultaneously from the sound file into a subsequent smaller size file.
 12. The method of providing a sound file to a voice gateway as in claim 11 wherein the converted smaller size file is provided to the voice gateway once the conversion is completed.
 13. The method of providing a sound file to a voice gateway as in claim 11 wherein an old converted smaller size file is overwritten with new data while building a new file to conserve system space in an application server.
 14. The method of providing a sound file to a voice gateway as in claim 13 wherein the voice gateway is informed of the overwriting process, so that new converted files are provided to the user instead of the old ones.
 15. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, comprises: division means for reducing the sound file into smaller files; altering means for converting the sound file header to reflect the smaller files size, and attaching it to the smaller files; conversion means for rendering the smaller files usable by the voice gateway, and communication means for providing the converted smaller files to the voice gateway.
 16. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, as in claim 15 wherein communication means provide the voice gateway with information regarding the files to be coupled via the connection.
 17. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, as in claim 15 wherein communication means overwrites old converted files with new data to build a new smaller size file.
 18. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, as in claim 15 wherein conversion means processes the sound file header using a selected algorithm to divide the sound file into smaller size files to provide one of predefined file size, increasing file size, and decreasing file size.
 19. A computer-readable medium having computer-executable instructions for performing a method of providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, the method comprising the steps of: dividing the sound file into smaller files; altering the sound file header to reflect the smaller files size, and attaching it to the smaller files; converting the smaller files into a format usable by the voice gateway, and providing the converted smaller files to the voice gateway.
 20. The computer-readable medium having computer-executable instructions for performing the method of providing a sound file to a voice gateway, as in claim 19 wherein the step of dividing comprises using a selected algorithm to divide the sound file into smaller size files to provide one of predefined file size, increasing file size, and decreasing file size.
 21. The computer-readable medium having computer-executable instructions for performing the method of providing a sound file to a voice gateway, as in claim 19 wherein the step of providing comprises overwriting an old converted smaller size file with new data to build a new file.
 22. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, comprises: at least one processor programmed to reduce the sound file into smaller files, convert the sound file header to reflect the smaller files size, attach the converted sound file header to the smaller files, and render the smaller files usable by the voice gateway; and communication channel coupled to the processor to provide the converted smaller files to the voice gateway.
 23. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, as in claim 22 wherein communication channel provides the voice gateway with information regarding the files to be coupled via the connection.
 24. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, as in claim 22 wherein the at least one processor overwrites old converted files with new data to build a new smaller size file.
 25. A communication system for providing a sound file to a voice gateway for coupling to a receiver via a voice based telephony connection, as in claim 15 wherein the at least one processor processes the sound file header using a selected algorithm to divide the sound file into smaller size files to provide one of predefined file size, increasing file size, and decreasing file size. 